Do All Spherical Viruses Have Icosahedral Symmetry? 
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Recent high resolution structures for viral capsids with 12, 32 and 72 subunits (Tl, T3 and T7 viruses) have 
confirmed theoretical predictions of an icosadeltahedral structure with 12 subunits having five nearest neighbors 
(pentamers) and (lOT + 2) — 12 subunits having six nearest neighbor subunits (hexamers). Here we note that 
theoretical considerations of energy strain for TA, T9, TIG and T25 viruses by aligned pentamers and energy 
strain along with the sheer number of possible arrangement of pentamers as the number of subunits grows, and 
simulations for such numbers of subunits make an icosadeltahedral configuration either miraculously unlikely 
or indicate that there must be a principle of capsid assembly of unprecedented fidelity in Nature. We predict, for 
example, that high resolution data will show TA capsids to have Dsh not icosahedral symmetry. 



More than half a century ago Crick and Watson 1 1 ] had the 
ingenious insight that viral capsids must be made of multi- 
ple units of the same small number of proteins, lest the viral 
genome be orders of magnitude too large — if coding for each 
of the hundreds or thousands of capsid proteins separately — 
to fit inside the capsid. Caspar and Klug|2] made a significant 
advance in appreciating that the structure of a number of viral 
capsids had icosahedral symmetry. They described capsids by 
a number T = + + ab (a, b non-negative integers) hav- 
ing N ~ lOT + 2 subunits arranged into an icosadeltahedral 
lattice. However, determination of the structure of these large 
capsids is a tour de force of experimentation, and until recent 
high resolution studies confirming icosadeltahedral configu- 
rations for riH , T3|ll and TtJIIsIi viruses, the structures 
were typically determined by fitting relatively low resolution 
experimental data to a model of a capsid with icosahedral 
symmetry (see ref. |0] and refs. therein). There has been im- 
portant work using theory, modeling and simulations to try to 
understand how icosadeltahedral capsids form|7, 8, 9]. Here 
using J.J. Thomson's problem ifioll of the arrangement of unit 
point charges on a sphere we quantify the thermodynamic and 
kinetic properties of icosahedral configurations. We find that 
for Tl, T3 and T7 viruses icosadeltahedral capsids are fa- 
vored energetically and occur the vast majority of times in 
simulations starting from random configurations. Conversely, 
intriguingly, for N = 42, 92, 162 and 252 (T4, T9, T16, T25) 
the icosahedral configuration is not only neither the minimum 
energy nor the most commonly found in our simulations start- 
ing random configurations, and never is found in our simula- 
tions at all. Indeed, for iV = 42 (TA) the minimum energy 
configuration has D^h symmetry and occurs in 98% of runs. 
The icosadeltahedral configuration has a higher energy and is 
not found. We predict that T4, TQ, T16 and T25 virus cap- 
sids do not have icosahedral symmetry. If they are found in 
high resoultion experimental data, then novel ideas are needed 
to explain such high fidelity assembly of thermodynamically 
and statistically unfavored configurations. 

Over one hundred years ago J. J.Thomson ifioll asked the 



question of the minimum energy configuration of N unit point 
charges on (the surface of) a unit condu cting Bill Il2l Il3l \U 




sphere. Much theoretical, numerical and experimental work 
since then has made considerable progress on Thomson's 
problem yielding interesting and nonobvious results: ForiV = 
4 charges the global minimum energy configuration is the ge- 
ometrically symmetric configuration of a tetrahedron. How- 
ever, for = 8 the minimum energy configuration is not 
a cube, but rather an anticube — four charges arranged in a 
square parallel to the equatorial plane in both the Northern 
and Southern hemispheres, but with the squares rotated by 45 
degrees with respect to each other This configuration has a 
lower energy than a cube as the rotation of the squares lowers 
the energy between nearest neighbor charges between the two 
squares. The case of iV = 8 also illustrates a general phe- 
nomenon in Thomson's problem whereby the most symmet- 
ric configuration is not necessarily the configuration of mini- 
mum energy. However, symmetry considerations can also be 
a useful guide to finding minimum energy configurations. For 
12 < < 100 (and likely for the most part up to iV = 200) 
attack of Thomson's problem by multiple numerical and theo- 
retical approaches and methods has likely found the minimum 
energy configurations. In most cases there are exactly twelve 
charges with five nearest neighbors — ^pentamers and the rest 
of the charges with six nearest neighbors — hexamers. Euler's 
theorem for convex polyogons — the number of vertices plus 
faces equals the number of edges plus 2 {V + F — E + 2) — 
has the result for points on a sphere that there must be at least 
twelve pentamers with the rest of the charges being hexamers 
or pentamer/septamer pairs. Though, only for N of the form 
10(a^ + b'^ + ah) + 2 (where a > 6 > 0) is it possible for the 
twelve pentamers and the entire configuration to have icosa- 
hedral symmetry. While for TV = 12, 32, 72, 122, 132, 192, 
212, 272 and 282 (Table 1, Figure 1) the icosahedrally sym- 
metric configuration (an icosadeltahedral configuration) is the 
best known energy minimum (and the presumed global energy 
minimum configuration), for N = 42, 92, 162, 252 (and then 



2 



T numbers larger than 282) the icosadeltahedral configuration 
is not the global energy minimum. Instead configurations with 
exactly twelve pentamers, but with the pentamers arranged in 
D^hi D2, and C2 symmetries are the global energy min- 
ima for N — 42, 92, 162 and 252 respectively (Table I, Figure 
2). These symmetries of global energy minima hold not only 
for the l/r Coulomb potential, but for other representative 
electrostatic potentials as well (Table II). 

The reason that the icosadeltahedral configurations are not 
the global minima for N = 42, 92 and 162 ((2, 0), (3, 0) 
and 4, 0)) configurations is due to the energy cost of the ver- 
tices of the pentamers being nearer to each other than in non- 
icosahedral configurations. In general it seems that in seek- 
ing global energy minima for N <200 Nature uses the gen- 
eral strategies of moving and rotating pentamers. In some 
cases either because of pure geometrical constraints [15J or 
just to minimize the energy, occasionally a pentamer/ hep- 
tamer defect pair (dislocation defect in the language of elas- 
ticity) is needed to achieve a global energy minimum. As N 
grows larger — ^500 or so — important papers by Dodgson 
and Moore il2L 11311 showed that the energy strain of the pen- 
tamers which is necessitated by the topology of a sphere but 
which distort the pure hexagonal lattice that would be the en- 
ergy minimum on a flat sheet, is such that to lower the en- 
ergy pentamer/heptamer defect pairs are needed between all 
of the pentamers. Now, as N grows the number of local en- 
ergy minima, M{N) (12 < N < 112) was found to grow 
exponentially |[l3]: 



M{N) w 0.382 X exp (0.0497A^) . 



(1) 



Thus, if Nature is using an energy minimization strategy to 
find the configuration of alignment of molecules in a viral cap- 
sid it would seem that as N grows it will become increasingly 
difficult if not impossible to find the global energy minimum 
configuration. Furthermore, the number and relative depth 
and breadth of good local minima could also be a constraint 
on kinetic strategies that Nature may use to find the ultimate 
configuration. In Table 1 we show the number of times we 
found the various local minima in runs where we started the 
charges from 5000 random configurations and then used stan- 
dard conjugate gradient methods to go to a local minimum. 
We see that for = 12, 32 and 72 the minimum energy con- 
figuration and overwhelmingly the most common is in fact 
the icosadeltahedral configuration. For N = 42, 92 and 162 
however, the icosadeltahedral configuration is not only not the 
minimum energy configuration, it is not reached from random 
configurations ever. In Figure 3 we show the number of local 
minima found in our simulations (solid red circles) and those 
given by Eq. 1 versus N. For large N the number of minima 
found diverges from Eq. 1 since the number of initial runs 
is comparable to the number of local minima. Similarly, we 
found that for a potential energy of 1/ r*^^ and 1 /r'^ again the 
icosahedral configurations for N = 42, 92 and 162 are not 
global minima and virtually never occur in the simulations 
(Table II). For N —42 (TA) icosahedral and D^h symmetries 
look quite similar (Figures 2c and 4). If we only take into ac- 



count the balls' positions in Fig. 4, we can pass from the left 
model to the right one just by rotating an hemisphere by an 
angle of 27r/10. 

Based on these electrostatic model potentials we would pre- 
dict that n, T3 and T7 virus would have icosadeltahedral 
configurations as recently found experimentally (^J^J3]. T13 
viruses may have an icosadeltahedral configuration. We be- 
lieve that high resolution studies of T4 viruses will show a 
D^ii configuration in particular and not an icosadeltahedral 
configuration. We predict that T9, T16 and T25 viruses will 
not be found to have an icosadeltahedral configuration though 
we do not have a clear prediction for the structure of these 
viruses. Conversely, if a T4, T9, T16 or T25 virus were found 
to have an icosadeltahedral configuration given the essentially 
vanishing possibility of this from energetic considerations or 
statistical considerations based on electrostatic potentials, it 
would indicate a mechanical rule of assembly to be discov- 
ered that is of expontentially good precision. It is still also 
a mystery why nature seems to so prominently use capsids 
with T numbers of subunits to the exclusion of other numbers 
of subunits. The energetics and statistics are so favorable for 
N = 12, 32 and 72 (Tl, T3, Tl) 1 14, 15, 19J that protein sub- 
units consistent with these configurations must have emerged. 
The same factors would suggest a Z^s/j configuration for = 
42. A key question could be understanding the evolution of 
T16 capsids. In general, geometry and topology seem to be 
important constraints that need to be considered in viral evo- 
lution or even possible treatments for viral diseases. 
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N=42 N=92 N=162 



FIG. 2: Icosadeltahedral configuration b) Global minima 
c) Icosadeltahedral and global minima overlap. 




FIG. 3: Number of local minima found as a function of N. The 
continuous line is given by Eq. 1. 
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TABLE I: Frequencies of finding global minima and icosadeltahedral configurations. Erber and Hockney found similar frequencies using a 
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TABLE IL Frequencies of finding global minima and icosadeltahedral configurations. Data for a (l/?"^) potential. 
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